首页> 外文OA文献 >Monocular Depth Estimation with Hierarchical Fusion of Dilated CNNs and Soft-Weighted-Sum Inference

【2h】

Monocular Depth Estimation with Hierarchical Fusion of Dilated CNNs and Soft-Weighted-Sum Inference

机译：扩张的CNNs与层状融合的单目深度估计软加权和推论

代理获取

本网站仅为用户提供外文OA文献查询和代理获取服务，本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文，但由于OA文献来源多样且变更频繁，仍可能出现获取不到、文献不完整或与标题不符等情况，如果获取不到我们将提供退款服务。请知悉。

页面导航

摘要
著录项
相似文献
相关主题

摘要

Monocular depth estimation is a challenging task in complex compositionsdepicting multiple objects of diverse scales. Albeit the recent great progressthanks to the deep convolutional neural networks (CNNs), the state-of-the-artmonocular depth estimation methods still fall short to handle such real-worldchallenging scenarios. In this paper, we propose a deep end-to-end learningframework to tackle these challenges, which learns the direct mapping from acolor image to the corresponding depth map. First, we represent monocular depthestimation as a multi-category dense labeling task by contrast to theregression based formulation. In this way, we could build upon the recentprogress in dense labeling such as semantic segmentation. Second, we fusedifferent side-outputs from our front-end dilated convolutional neural networkin a hierarchical way to exploit the multi-scale depth cues for depthestimation, which is critical to achieve scale-aware depth estimation. Third,we propose to utilize soft-weighted-sum inference instead of the hard-maxinference, transforming the discretized depth score to continuous depth value.Thus, we reduce the influence of quantization error and improve the robustnessof our method. Extensive experiments on the NYU Depth V2 and KITTI datasetsshow the superiority of our method compared with current state-of-the-artmethods. Furthermore, experiments on the NYU V2 dataset reveal that our modelis able to learn the probability distribution of depth.

机译：单眼深度估计是描述各种规模的多个物体的复杂构图中的一项艰巨任务。尽管近来有了深层卷积神经网络（CNN）的巨大进步，但最新的单眼深度估计方法仍不足以应对此类现实世界中充满挑战的场景。在本文中，我们提出了一个深层的端到端学习框架来应对这些挑战，该学习框架学习了从彩色图像到相应深度图的直接映射。首先，与基于回归的公式相比，我们将单眼深度估计表示为多类别的密集标记任务。通过这种方式，我们可以建立在诸如语义分割之类的密集标记的最新进展上。其次，我们将前端扩张式卷积神经网络的不同侧输出以分层方式融合，以利用多尺度深度线索进行深度估计，这对于实现可感知尺度的深度估计至关重要。第三，我们建议使用软加权和推理代替硬最大值，将离散化的深度得分转换为连续的深度值。因此，减少了量化误差的影响，提高了方法的鲁棒性。在NYU Depth V2和KITTI数据集上进行的大量实验表明，与当前的最新方法相比，我们的方法具有优越性。此外，对NYU V2数据集的实验表明，我们的模型能够学习深度的概率分布。

著录项

作者
Li, Bo; Dai, Yuchao; He, Mingyi;
展开▼
作者单位

展开▼
年度 2017
总页数
原文格式 PDF
正文语种
中图分类

相似文献

外文文献
中文文献
专利

1. Monocular depth estimation with hierarchical fusion of dilated CNNs and soft-weighted-sum inference [J] . Li Bo, Dai Yuchao, He Mingyi Pattern Recognition: The Journal of the Pattern Recognition Society . 2018,第期

机译：具有扩张CNN的分层融合的单眼深度估计和软加权 - 和推论
2. Simultaneous Attack on CNN-Based Monocular Depth Estimation and Optical Flow Estimation [J] . Koichiro YAMANAKA, Keita TAKAHASHI, Toshiaki FUJII, IEICE transactions on information and systems . 2021,第5期

机译：同时攻击基于CNN的单眼深度估计和光学流量估计
3. Monocular depth ordering with occlusion edges extraction and local depth inference [J] . Systems Engineering and Electronics, Journal of . 2019,第6期

机译：具有遮挡边缘提取和局部深度推断的单眼深度排序
4. Spindle-Net: CNNs for Monocular Depth Inference with Dilation Kernel Method [C] . Lei He, Miao Yu, Guanghui Wang International Conference on Pattern Recognition . 2018

机译：主轴网：使用膨胀核方法进行单眼深度推断的CNN
5. Monocular Depth Estimation Using Adversarial Training [D] . Mitra, Pallavi. 2020

机译：使用对抗性培训的单眼深度估计
6. Fast CNN Stereo Depth Estimation through Embedded GPU Devices [O] . Cristhian A. Aguilera, Cristhian Aguilera, Cristóbal A. Navarro, 2020

机译：通过嵌入式GPU设备进行快速CNN立体声深度估计
7. Simultaneous Attack on CNN-Based Monocular Depth Estimation and Optical Flow Estimation [O] . Koichiro YAMANAKA, Keita TAKAHASHI, Toshiaki FUJII, 2021

机译：同时攻击基于CNN的单眼深度估计和光学流量估计

Monocular Depth Estimation with Hierarchical Fusion of Dilated CNNs and Soft-Weighted-Sum Inference

摘要

著录项

相似文献

相关主题

期刊订阅